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ABSTRACT 



A system and method for determining a root cause of error 
activity in a network is described herein. Root cause analysis 
includes the correlation between reported error activity for 
path, line and section entities along a provisioned channel in 
the network. Root causes can also be identified based upon 
the correlation of simultaneous error activity on various 
signal transport levels. Finally, root cause analysis can 
correlate error activity along various path entities. 

25 Claims, 17 Drawing Sheets 
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SYSTEM AND METHOD FOR UNREPORTED 
ROOT CAUSE ANALYSIS 

CROSS-REFERENCE TO OTHER 
APPLICATIONS 

The following applications of common assignee contain 
some common disclosure: 

U.S. Patent Application entitled "System and Method for 
Identifying the Technique Used for Far-End Performance 
Monitoring of a DS1 at a Customer Service Unit," Ser. 
No. 08/671,028, filed Jun. 25, 1996. 

U.S. Patent Application entitled "System and Method for 
Formatting Performance Data In a Telecommunications 
System," Ser. No. 08/670,905, filed Jun. 26, 1996. 

U.S. Patent Application entitled "System and Method for 
Reported Root Cause Analysis," Ser. No. 08/670,844, 
filed Jun. 28, 1996. 

U.S. Patent Application entitled "Enhanced Correlated Prob- 
lem Alert Signals," Ser. No. 08/670,848, filed Jun. 28, 
1996. 

U.S. Patent Application entitled "Correlated Problem Alert 

Signals," Ser. No. 08/673,271, filed Jun. 28, 1996. 
U.S. Patent Application entitled "Raw Performance Monitor 

Correlated Problem Alert Signals," Ser. No. 08/670,847, 

filed Jun. 28, 1996. 
U.S. Patent Application entitled "System and Method for 

Reported Trouble Isolation," Ser. No. 08/672,812, filed 

Jun. 28, 1996. 

U.S. Patent Application entitled "System and Method for 
Unreported Trouble Isolation," Ser. No. 08/672,513, filed 
Jun. 28, 1996. 

U.S. Patent Application entitled "System and Method for 

Monitoring Point Identification," Ser. No. 08/672,512, 

filed Jun. 28, 1996. 
U.S. Patent Application entitled "System and Method for 

End-to-End Threshold Setting," Ser. No. 08/670,845, filed 

Jun. 28, 1996. 

U.S. Patent Application entitled "System and Method for 
Monitoring Point Activation," Ser. No. 08/672,356, filed 
Jun. 28, 1996. 

U.S. Patent Application entitled "System and Method for 
Tracking and Monitoring Network Elements," Ser. No. 
08/671,029, filed Jun. 25, 1996. 

The above-listed applications are incorporated herein by 
reference in their entireties. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates generally to network man- 
agement systems, and more specifically is directed toward 
the determination of a root cause of error activity at one or 
more signal transport levels. 

2. Related Art 

The present application is a continuation of Application 
Ser. No. 08/668,516 filed Jun. 28, 1996 Entitled "System and 
Method for Unreported Root Cause Analysis". 

Telecommunication service providers (e.g., MCI Tele- 
communications Corporation) provide a wide range of ser- 
vices to their customers. These services range from the 
transport of a standard 64 kbit/s voice channel (i.e., DS0 
channel) to the transport of higher rate digital data services 
(e.g., video). Both voice channels and digital data services 
are transported over the network via a hierarchy of digital 
signal transport levels. For example, in a conventional 
digital signal hierarchy 24 DS0 channels are mapped into a 
DS1 channel. In turn, 28 DS1 channels are mapped into a 
DS3 channel. 
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Routing of these DS1 and DS3 channels with in a node of 
the network is performed by digital cross-connect systems. 
Digital cross-connect systems typically switch the channels 
at the DS1 and DS3 signal levels. Transmission of channels 

5 between nodes is typically provided via fiber-optic trans- 
mission systems. Fiber-optic transmission systems can mul- 
tiplex a plurality of DS3 channels into a higher rate trans- 
mission over a single pair of fibers. In one example, signal 
formats for the fiber-optic transmission systems are defined 

10 by the manufacturer. These proprietary systems are referred 
to as asynchronous transmission systems. 

Alternatively, a fiber-optic transmission system can 
implement the synchronous optical network (SONET) stan- 
dard. The SONET standard defines a synchronous transport 

15 signal (STS) frame structure that includes overhead bytes 
and a synchronous pay load envelope (SPE). One or more 
channels (e.g., DS1 and DS3 channels) can be mapped into 
a SPE. For example, a single DS3 channel can be mapped 
into a STS-1 frame. Alternatively, 28 DS1 channels can be 

20 mapped into virtual tributaries (VTs) within the STS-1 
frame. 

Various STS-1 frames can be concatenated to produce 
higher rate SONET signals. For example, a STS-12 signal 
includes 12 STS-1 frames, while a STS-48 signal includes 

25 48 STS-1 frames. Finally, after an STS signal is converted 
from electrical to optical, it is known as an optical carrier 
(OC) signal (e.g., OC-12 and OC-48). 
An end-to-end path of a provisioned channel within a 

3Q network typically traverses a plurality of nodes. This pro- 
visioned channel is carried over transmission facilities that 
operate at various rates in the digital signal hierarchy. For 
example, a provisioned DS1 channel may exist as part of a 
DS3, VT1.5, STS-1, STS-12, OC-12, and OC-48 signal 

35 along parts of the end-to-end path. This results due to the 
multiplexing and demultiplexing functions at each of the 
nodes. 

One of the goals of a network management system is to 
monitor the performance of the provisioned channel. Per- 

40 formance of the provisioned channel can include various 
measures. One measure is the unavailability of the provi- 
sioned channel. Unavailability is generally defined as the 
amount (or fraction) of time that a channel is not operational. 
Various causes such as cable cuts can lead to channel 

45 downtime. Network responses to channel downtime can 
include automatic protection switching or various restora- 
tion procedures (e.g., digital cross-connect distributed 
restoration). 

Although unavailability is a major performance measure 
50 from a customer's standpoint, other performance measures 
can also be critical. For example, if a customer desires a 
digital data service for the transmission of financial data, the 
number of errored seconds or severely errored seconds may 
be a concern. 

55 In conventional network management systems, perfor- 
mance monitoring is accomplished in piecewise fashion. For 
example, consider a provisioned channel that traverses an 
end-to-end path comprising asynchronous transmission sys- 
tems and SONET transmission systems. Performance moni- 

60 loring information for these two types of transmission 
systems is typically maintained in separate databases. 
Moreover, the various types of transmission systems may be 
provided by multiple vendors. Each of these vendors may 
define their own separate performance monitoring process. 

65 For example, the vendor-controlled process may define the 
types of data that are retrieved from or reported by the 
individual network elements. 
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In this environment, comprehensive performance moni- 
toring analysis is difficult to accomplish. What is needed is 
a network management system that can monitor provisioned 
channels at various points of the end-to-end path and iden- 
tify the root cause of problems that lead to observable error 
activity. This capability allows a service provider to effi- 
ciently resolve problems that lead to degradation of network 
performance. 

SUMMARY OF THE INVENTION 

The present invention satisfies the above mentioned needs 
by providing a comprehensive network management system 
that can isolate a root cause of a problem in the network. In 
a first embodiment of the present invention, the root cause 
analysis operates on problem alert signals (PASs) generated 
by monitoring points in the network. One example of a 
problem alert signal is a threshold crossing alert. A threshold 
crossing alert is generated when a monitored performance 
parameter exceeds a predefined threshold. 

In a second embodiment of the present invention, the root 
cause analysis operates on reported error activity. Statistical 
analysis can be used to identify facility operating conditions 
that could lead to a significant network problem. For 
example, the network facility could be operating at a point 
near the tolerance levels. Intermittent errors could therefore 
result due to temporary excursions beyond the tolerance 
thresholds. If a statistical analysis identifies a potential 
problem, a raw performance monitoring PAS is generated. 

In the present invention, root cause analysis seeks to 
identify sources of problems identified by a plurality of 
PASs that are visible to a layer in a network management 
system. Analysis of the plurality of PASs include various 
correlation processes. In one method, path PASs are corre- 
lated to line PASs. In this manner, error activity identified on 
a provisioned channel can be isolated to a particular line 
entity of the network. Further, in a second method of root 
cause analysis, line PASs are correlated to section PASs. 
This second correlation process allows error activity iden- 
tified on a line entity to be isolated to a particular section 
entity within the line entity. As part of this general correla- 
tion process, the root cause analysis correlates error activity 
between signal levels in the signal transport hierarchy. In 
this method of analysis, the highest transport level experi- 
encing error activity is identified. In this manner, the net- 
work facilities that are the source of the problem can be 
identified. 

An additional method of root cause analysis also corre- 
lates error activity between path entities. Error activity on 
various path entities may be caused by a common facility 
problem that is not detected by line entities within the path 
entity. Identification of common line entities within the 
various path entities eliminates redundant root cause analy- 
sis processing of the individual path entities. 

The foregoing and other features and advantages of the 
invention will be apparent from the following, more par- 
ticular description of a preferred embodiment of the 
invention, as illustrated in the accompanying drawings. 

BRIEF DESCRIPTION OF THE FIGURES 

In the drawings, like reference numbers indicate identical 
or functionally similar elements. Additionally the left-most 
digit of a reference number identifies the drawing in which 
the reference number first appears. 

FIG. 1 illustrates the layers in a network management 
system. 
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FIG. 2 illustrates an exemplary circuit topology. 

FIGS. 3 and 7 illustrate flow charts of the correlation 
process between path and line entities. 

FIGS. 4 and 8 illustrate flow charts of the correlation 
5 process between line and section entities. 

FIGS. 5A-5D and 9A-9D illustrate flow charts of the 
correlation between levels in the digital signal transport 
hierarchy. 

10 FIGS. 6 and 10 illustrate flow charts of the correlation 
between path entities. 

FIG. U illustrates a block diagram of a computer useful 
for implementing elements of the present invention. 

15 DETAILED DESCRIPTION OF THE 

PREFERRED EMBODIMENTS 

The operation and administration of a service provider's 
network is becoming increasingly complex. Network ele- 
ments continue to evolve in support of the provision of a 

20 wider range of services. The overriding goal of network 
management is to ensure that all aspects of the network are 
operating according to both the service provider's design 
and the customer's expectations. 

A general open-ended framework is defined by the Inter- 
national Telecommunications Union (ITU) Telecommunica- 
tions Management Network (TMN) standard. The TMN 
standard defines a layered framework for a service provider 
to implement its own network management process. 

30 FIG. 1 illustrates a network management system 100 that 
includes five layers 110, 120, 130, 140 and 150. Layer 150 
is designated as the network element layer (NEL). The NEL 
is a physical layer that includes the various network ele- 
ments (e.g., asynchronous systems, SONET systems, etc.) 

3S used in the transport and routing of network traffic (e.g., 
DS1, DS3, OC-N, etc.). Each network element 151-156 in 
NEL 150 can be designed to provide performance 
monitoring, alarm and status information to the higher layers 
in network management system 100. In particular, network 

40 elements 151-156 are connected to one of the element 
managers 141-143 in element management layer (EML) 
140. For example, network elements 151 and 152 are 
connected to element manager 141. In this manner, each 
network element manager 141-143 controls a portion of the 

45 physical network embodied in NEL 150. 

Element managers 141-143 can retrieve information from 
network elements 151-156 periodically or upon a user 
request. Alternatively, network elements 151-156 can be 
programmed to provide element managers 141-143 with a 

50 predefined subset of network management information at 
predefined time intervals. The domain of an element man- 
ager 141-143 can be defined by a vendor's equipment. In 
some situations, the domain of an element manager 141-143 
is dictated by the geography in which network elements 

55 151-156 reside. 

After network management information is acquired by 
element managers 141-143 from network elements 
151-156, it is forwarded to network management layer 
(NML) 130. NML 130 comprises network manager 131. 

60 Network manager 131 is logically shown as a single entity. 
In implementation, network manager 131 can comprise one 
or more sites. For example, multiple service centers (not 
shown) can exist at different parts of the country (e.g., east 
coast and west coast). In combination, these national-level 

65 service centers combine to provide total visibility of the 
physical network in NEL 150. Network manager 131 can 
also be split among services and/or network elements. For 
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example, in one embodiment, a first network manager is the information stored in one or more layers of network 

dedicated to asynchronous parts of the network, a second management system 100. 

network manager is dedicated to DS1, DS3 and VT-n traffic, fa a fat embodiment, root cause analysis is operative on 

and a third network manager is dedicated to STS-n and OC-n error activity that is reported from monitoring points that are 

traffic. s associated with network elements 151-156. Monitoring 

Generally, the logical entity identified as network man- points are described in greater detail in related applications 

ager 131 is a resource that is accessed by applications in entitled "System and Method for Monitoring Point 

service management layer (SML) 120. In FIG. 1, SML120 Identification," Ser. No. 08/672,512, filed Jun. 28, 1996; 

is shown to include five applications 121-125. Specifically, "System and Method for End-to-End Threshold Setting," 

SML 120 includes provisioning application 121, accounting/ 10 Ser. No. 08/670,845, filed Jun, 28, 1996; and "System and 

billing application 122, security application 123, network Method for Monitoring Point Activation," Ser. No. 08/672, 

performance application 124, and fault management appli- 356, filed Jun. 28, 1996. 

cation 125: This listing of applications is provided without Generally, monitoring points are labeled as either primary 
limitation. Any other application that utilizes network man- monitoring points or secondary monitoring points. A basic 
agement data stored within NEL 150 can also be included. 15 monitoring point strategy for a generic end-to-end path 
Note that elements of applications 121-125 also reside includes the placement of monitoring points nearest to the 
within EML 140 and NML 130. facility end points. Monitoring points nearest to the facility 
Provisioning application 121 provides a customer inter- end points are designated as primary monitoring points 
face for the provisioning of various services. For example, (PMPs), A facility end point can be thought of as a generic 
a customer can indicate a desire for a DS1 digital data 20 customer termination point (e.g., handoff point to a business, 
service between network element 151 and network element local exchange carrier, etc.). In the context of FIG. 2, PMPs 
155. Upon receipt of this customer request, provisioning can be associated with source 202 and destination 204. 
application 121 relays the provisioning commands to net- fa addition to the PMPs, a provisioned channel may also 
work manager 131. Network manager 131 then communi- include SMPs. SMPs are intermediate performance moni- 
cates with element managers 141, 143 and any other element 25 to ring data collection points. SMPs allow network manage- 
managers that control a part of the end-to-end path to set up me nt system 100 to isolate a network problem by providing 
the DS1 connection from network elements 151-155. performance monitoring information at intermediate sec- 
Applications 122-125 can similarly support a customer tions of the end-to-end path, 
interface by providing access to billing information, security 30 . Over a period of time, a monitoring point associated with 
information, performance information and fault manage- a network element may observe an excessive number of ESs 
ment information, respectively. Each of these applications on a received channel. If the monitoring point determines 
also access the resources that are stored within network that the error activity is significant, a problem alert signal 
manager 131. (PAS) is reported to one of element managers 141-143. Note 
Finally, network management system 100 also includes 35 that any type of statistical analysis can be used by the 
business management layer (BML) 110. BML 100 includes monitoring point to determine whether to report a PAS to an 
logical entity 111. Logical entity 111 represents the general element manager 141-143. For example, the monitoring 
corporate policy of network management system 100, Cor- point could determine whether a number of ESs exceeds a 
porate policy 111 dictates the general business and contrac- predefined threshold. In the remainder of the description, 
tual arrangements of the service provider. ^ PASs are used to describe the general class of reported error 

Having identified the various layers in network manage- activity, 
ment system 100, a system and method for root cause Note that a monitoring point can generate a PAS based 
analysis is now described. Root cause analysis is generally upon error activity measured at section, line, and path 
concerned with the identification of a source of a problem in terminating points. FIG. 2 illustrates section, line and path 
the network. Problems in the network may or may not 45 entities in an exemplary channel (e.g., DS1, DS3, VT-n, 
involve actual system downtime. In other words, problems STS-1, STS-3c, STS-12c, etc.) provisioned between source 
in the network may manifest themselves as degradations in 202 and destination 204, In this example, the channel 
system performance. An example of performance degrada- originates at source 202, traverses multiplexer 210, regen- 
tion includes an increase in the bit error rate (BER). Bit erators 220 and 230, and cross-connect 240, and finally 
errors are typically measured in terms of errored seconds 50 terminates at destination 204. Each network element 202, 
(ESs) and severely errored seconds (SESs). An unacceptable 204, 210, 220, 230 and 240 inserts and extracts overhead 
increase in the BER of the provisioned channel may prove information. In the SONET context, section and line over- 
unsatisfactory to the customer. In many instances, customer head information is contained within the transport overhead 
expectations of performance of a provisioned channel are portion of a synchronous transport signal (STS) frame (not 
defined by the requirements contained within a service 55 shown). Path overhead information, on the other hand, is 
contract. The service contract may correlate system perfor- contained within the synchronous payload envelope (SPE) 
mance to the tariffing structure. If the number of ESs, SESs information payload. Note that the terms section, line and 
or unavailability of the service becomes excessive, customer path are used without limitation. As would be apparent to 
rebates may be in order. one of ordinary skill in the relevant art, the root cause 
In a competitive business climate, it is desirable for a 60 analysis described below could also be extended to other 
service provider to quickly identify and repair problems network transmission standards having analogous network 
leading to system downtime or degradation. If a root cause sectionalizauon. 

of a problem cannot be pinpointed, it will continue to affect In the following description, section, line and path entities 

system performance and a customer's perceptions. The are used to refer to the portions of the network that insert and 

present invention provides a system and method for identi- 65 extract section, line and path overhead bytes, respectively, 

fying the root cause of network problems using a compre- For example, the OC-N link between regenerators 220 and 
hensive approach. This comprehensive approach analyzes 230 define a section entity. In the transmission from regen- 
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erator 220 to regenerator 230, regenerator 220 inserts the 
section overhead bytes into the STS-N frame, performs an 
electrical-to-optical conversion of the STS-N signal, and 
transmits the OC-N signal to regenerator 230. Regenerator 
230 then receives the OC-N signal, performs an optical- to- 5 
electrical conversion and extracts the section overhead infor- 
mation. Regenerator 230 can use the extracted section 
overhead information to determine the error performance of 
the transmission from regenerator 220. For example, errors 
may have been created by a problem with the optical fiber, 10 
a problem with a fiber-optic connector, etc. A monitoring 
point associated with regenerator 230 tracks this error per- 
formance over a monitoring period and reports the accumu- 
lated results to an element manager 141-143. Note that 
regenerators 220 and 230 extract only the section overhead is 
information. For this reason, the span between regenerators 
220 and 230 define section entity 252 not line or path 
entities. 

Next, consider the span between multiplexer 210 and 
cross-connect 240. In this span, multiplexer 210 and cross- 20 
connect 240 insert and extract both line and section over- 
head information. Note that only one direction of the two- 
way communication is illustrated. As shown, the insertion 
and extraction of line overhead information defines line 
entity 257. Line entity 257 includes section entities 25 
252-254. Finally, consider source 202 and destination 204. 
These elements insert and extract path overhead 
information, thereby defining path entity 259. Path entity 
259 includes section entities 251-255 and line entities 
256-258. 30 

In considering the network as a whole, each channel 
provisioned over a plurality of network elements defines its 
own section, line and path entities. Monitoring points asso- 
ciated with these network elements can monitor error activ- 
ity for section, line or path entities. Based upon the moni- . 35 
tored error activity, a monitoring point can determine 
whether a PAS should be generated. This PAS is sent to an 
element manager 141-143. The PAS can also be forwarded 
to higher layers in network management system 100. ^ 

During a single monitoring period, monitoring points for 
the various provisioned channels in the network can report 
section, line and path PASs. Each layer in network manage- 
ment system 100 can analyze the reported PASs to determine 
whether one or more root causes exist. The root cause 45 
analysis can simultaneously utilize one or more of the 
following methods of PAS analysis. 

A first method of analysis is based upon the relationship 
between path entities and line entities. FIG. 3 illustrates a 
flow chart of the path and line entity correlation process. The 50 
process begins at step 302 where a layer in network man- 
agement system 100 receives a path PAS. Assume that a 
monitoring point associated with destination 204 has 
reported a path PAS to element manager 143, Element 
manager 143 then determines, at step 304, whether it has 5S 
enough information to identify the root cause of the network 
problem. 

Recall that each element manager 141-143 has access to 
network management information received from a subset of 
network elements 151-156. If the reported PAS was caused 60 
by a problem outside of an element manager's domain, then 
that particular element manager cannot isolate the problem. 
Various sequences of decisions are used by each element 
manager 141-143 to determine if the problem can be 
isolated. 65 

For example, consider the reported performance param- 
eter of severely errored seconds (SESs). SESs can be clas- 
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sifted into two types, line SESs (SESL) and path SESs 
(SESP). If a PAS is generated based upon an excessive 
number of SESLs, then the element manager knows that the 
problem is local. Assuming that both ends of the line reside 
in the element manager's domain, the element manager can 
conclude that the network problem can be isolated. 

If a PAS is triggered based upon an excessive number of 
SESPs, then the element manager knows that the network 
problem may not necessarily reside in the element manag- 
er's domain. An additional determination must be made as 
to whether the originating and terminating network elements 
reside in the element manager's domain. If both reside in the 
element manager's domain, then the element manager 
knows that it can isolate the network problem. 

As would be apparent to one of ordinary skill in the 
relevant art, implementation dependent decision trees can be 
designed to determine whether a root cause of a network 
problem can be isolated based upon a specific type of PAS. 
If an element manager 141-143 determines that it does not 
have enough information to identify the root cause of the 
network problem, then the root cause analysis must be 
performed by the next highest layer in network management 
system 100 (i.e., NML 130). This is illustrated by step 306. 
Note that network manager 131 of NML 130 begins the 
isolation process as soon as a PAS is received from one of 
element managers 141-143. 

Once control passes to NML 130, network manager 131 
similarly determines whether it has enough information to 
isolate the problem. As noted above, in one embodiment 
multiple network managers are used. In this case, root cause 
analysis is performed by an application in SML 120 using 
the resources contained in the multiple network managers. 

In the following description, assume that element man- 
ager 143 determines, at step 304, that it has enough infor- 
mation to identify the root cause. Next, element manager 
143 determines, at step 308, whether a distinct line entity 
exists for the path entity. In other words, element manager 
143 determines whether the path entity includes more than 
one line entity. If a path entity includes only one line entity, 
then the line and path overhead information inserted at 
source 202 is extracted at the same point. In the context of 
FIG. 2, this scenario would arise if multiplexer 210, regen- 
erators 220 and 230, and cross-connect 240 did not exist. 

If element manager 143 determines, at step 308, that the 
path entity does not include distinct line entities, then the 
path-line correlation process of FIG. 3 ends. In this case, the 
root cause analysis cannot be further narrowed to a part of 
the path entity. However, if element manager 143 
determines, at step 308, that the path entity does include 
distinct line entities, then element manager 143 next 
determines, at step 310, whether a line entity within the path 
entity has reported a corresponding PAS. 

If none of the line entities within the path entity has 
reported a corresponding PAS, then the path-line correlation 
process ends. This results since the reported PASs within the 
current monitoring period does not contain enough line 
entity information to further identify the root cause of the 
path PAS. In one example, the root cause of the path PAS 
may not have been identified by monitoring points associ- 
ated with the line entities. In another example, the monitor- 
ing points associated with the line entities may not have been 
activated. Subsequent activation of the monitoring points 
will allow further root cause analysis in a later monitoring 
period. 

Finally, if element manager 143 determines, at step 310, 
that a line entity has reported a corresponding PAS, then the 
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root cause of the network problem is narrowed to a particular 
line entity. Additional root cause analysis then proceeds at 
step 312 to further identify the root cause of the problem 
within the identified line entity. For example, the root cause 
could reside in a section entity within the line entity. Note 
that more than one line entity within a path entity can report 
a corresponding PAS. For example, errors within line enti- 
ties 256 and 257 of FIG. 2 could both be sufficient to trigger 
the reporting of a PAS. 

FIG. 4 illustrates a second method of root cause analysis 
that is based upon the correlation of PASs reported by line 
entities and section entities. The line-section correlation 
process begins at step 402 where a layer in network man- 
agement system 100 receives a line PAS. In the following 
discussion, assume that a monitoring point associated with 
cross-connect 240 has reported a line PAS to element 
manager 143. Element manager 143 then determines, at step 
404, whether it has enough information to identify the root 
cause of the network problem. For example, if a line PAS is 
generated by a monitoring point associated with cross- 
connect 240 of line entity 257, element manager 143 may 
determine that multiplexer 210 and regenerator 220 are in 
another element manager's domain. In this case, root cause 
analysis of the line PAS is performed at a higher layer in 
network management system 100 that has access to infor- 
mation for line entity 257. 

If element manager 143 determines, at step 404, that it has 
enough information to identify the root cause, then element 
manager 143 determines, at step 408, whether a distinct 
section entity exists for the line entity. In the example of line 
entity 257, distinct section entities 252-254 would exist. 

If element manager 143 determines, at step 408, that the 
line entity does not include distinct section entities, then the 
line-section correlation process of FIG. 4 ends. In this case, 
the root cause analysis cannot be further narrowed to a part 
of the line entity. However, if element manager 143 
determines, at step 408, that the line entity does include 
distinct section entities, then element manager 143 next 
determines, at step 410, whether a section entity within the 
line entity has reported a corresponding PAS. 

If none of the section entities within the line entity has 
reported a corresponding PAS, then the line-section corre- 
lation process ends. This results since the reported PASs 
within the current monitoring period does not contain 
enough section entity information to further identify the root 
cause of the line PAS. In a similar manner to the line -section 
correlation process described above, the path -line correla- 
tion process could result if the monitoring points associated 
with the section entities have not been activated. 

Finally, if element manager 143 determines, at step 410, 
that a section entity has reported a corresponding PAS, then 
the root cause of the network problem is narrowed to the 
particular section entity. Since the section entity is the lowest 
level of granularity, the root cause analysis is complete. 
Thus, at step 412, the section entity is identified as the root 
cause of the network problem. Again, note that more than 
one section entity within a line entity can report a PAS. 

As described above, a function of the root cause analysis 
is to correlate PASs. Path PASs can be correlated with line 
PASs and line PASs can be further correlated to section 
PASs. As part of this general correlation process a layer in 
network management system 100 correlates PASs between 
various signal levels in the transport hierarchy. Generally, 
when error activity is detected at a particular digital trans- 
port level, the correlation of that error activity to other error 
activity within the digital transport level hierarchy is not a 
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simple matter. For example, error activity detected at the 
DS3 level may not be detected at the STS-48 level. This may 
result due to the insufficient granularity of the error detection 
at the higher STS-48 level. An error at the STS-48 level may 

5 affect some or all of the lower transport levels within the 
STS-48 signal depending on the severity and the distribution 
of the error event. For example, bursty errors may affect only 
some of the lower transport levels while a continuous error 
may affect all lower transport levels. For this reason, per- 

10 formance monitoring data is collected at different hierarchial 
signal transport levels and error activity is correlated 
between the different signal transport levels. 

In this correlation process, the highest rate in the transport 
signal hierarchy experiencing simultaneous error activity is 

is identified. FIGS. 5A-5D illustrate this correlation process of 
identifying the highest signal transport level that experiences 
simultaneous error activity. In the following description, 
assume that element manager 141 has received a DS1 path 
PAS from a network element in its domain. 

20 First, at step 502, element manager 141 determines 
whether the DS1 channel is mapped into a DS3 channel or 
into a VT-1.5 channel If the DS1 channel is mapped into a 
DS3 channel, the process continues at step 506. At step 506, 
element manager 141 determines whether the DS3 in which 

25 the DS1 is mapped has reported a PAS. In other words, 
element manager 141 determines whether the DS3 in which 
the DS1 is mapped has simultaneous error activity occur- 
ring. 

In a similar manner to step 506, step 508 is invoked if 
30 element manager 141 determines, at step 502, that the DS1 
is mapped into a VT-1.5 channel. At step 308, element 
manager 141 determines whether the VT-1.5 in which the 
DS1 is mapped has reported a PAS. Again, element manager 
141 determines whether the VT-1.5 has simultaneous error 
35 activity occurring. 

If the determination at either step 504 or step 506 deter- 
mines that the VT-1.5 or DS3, respectively, has not reported 
a PAS, then element manager 141 reports the DS1 path PAS 
40 to network manager 131 in NML 130. Next, at step 510, the 
highest transport level is identified as a DS1 and the process 
ends. 

Returning to step 506, if the DS3 in which the DS1 is 
mapped does report a PAS, the process continues to step 512 

45 of FIG. 5B. At step 512, element manager 141 determines 
whether the DS3 is mapped into a STS-1. If the DS3 is not 
mapped into an STS-1, the DS3 PAS is reported to network 
manager 131 at step 516. Thereafter, the highest transport 
level is identified as a DS3. If element manager 141 

5 0 determines, at step 314, that the DS3 is mapped into an 
STS-1, element manager 141 then determines whether the 
STS-1 has reported a PAS. If a STS-1 PAS was not reported, 
the DS3 PAS is reported to network manager 131. 

Returning to step 504, if the VT-1.5 in which the DS1 is 

55 mapped does report a PAS, the process continues to step 520 
of FIG. 5C. At step 520, element manager 141 determines 
whether the STS-1 in which the VT-1.5 is mapped has 
reported a PAS. Here, no determination is made as to 
whether the VT-1.5 is mapped into a STS-1. This results 

60 because a VT-1.5 channel cannot exist in the network 
independently of a STS-1 channel. If the associated STS-1 
in which the VT-1.5 is mapped has not reported a PAS, the 
VT-1.5 PAS is reported to network manager 131 at step 522. 
Next, the highest transport level is identified as a VT-1.5 

65 channel at step 524. 

If element manager 141 determines at either step 514 or 
step 520 that a STS-1 has reported a PAS, the process 



08/22/2004, EAST Version: 1.4.1 



6,072,777 

11 12 

continues at step 526 of FIG. 5D. At step 526, element line PAS. At this point, the root cause analysis has narrowed 

manager 141 determines whether the STS-1 is mapped into the potential source of the error activity to line entity 257. 

an OC-48 system or an OC-3/12 system. An OC-48 system Further identification of the root cause of the problem is 

is generally used to transport STS-ls between two nodes in provided by an additional correlation between PASs for line 

the network. An OC-3 or OC-12 system, on the other hand, S entity 257 and PASs for section entities 252-254. 

is generally used to transport STS-ls between network Having described a correlation process between path, line 

elements within a particular node. For example, an OC-3 or m $ section PASs along the circuit topology of a single 

OC-12 fiber optic link could be used to transport STS-ls provisioned channel, a correlation process between indepen- 

between an OC-48 line terminating equipment (LTE) and a de nt pam PASs is now provided. As described above, each 

broadband digital cross-connect system (BBDCS). 10 pam PAS that is received by a layer in network management 

Jf element manager 141 determines at step 526 that the system 100 could require independent root cause analysis 

STS-1 is mapped into an OC-3/12 system, clement manager consideration. Root cause analysis would generally seek to 

141 then determines, at step 328, whether the OC-3/12 narrow the consideration from a path level to a line or 

system has reported a PAS. If the OC-3/12 system has section level using the correlation processes outlined above, 

reported a PAS, the highest transport level is identified as a is Note ^ however, that error activity detected for a path 

OC-3/12 at step 532, Conversely, if the OC-3/12 system has entity may not necessarily be identified at line or section 

not reported a PAS, a STS-1 PAS is reported to network entities. This results due to the error monitoring process for 

manager 131 at step 534. Next, the highest transport level is ii ne and section entities. Consider the span defined by 

identified as a STS-1 at step 536. section entity 251 and line entity 256 of FIG. 2. In this 

If element manager 141 determines at step 526 that the 20 example, the line and section overhead bytes are inserted 

STS-1 is mapped into an OC-48 system, element manager into the transport overhead part of a STS-1 frame at source 

141 then determines, at step 530, whether the OC-48 system 202. These line and section overhead bytes are extracted by 

has reported a PAS. If the OC-48 system has reported a PAS, multiplexer 210. Included within the fine and section over- 

the highest transport level is identified as a OC-48 at step head bytes is a bit interleaved parity byte (BIP-8) that 

538. Conversely, if the OC-48 system has not reported a 25 provides even parity over the previous STS-1 frame. The 

PAS, a STS-1 PAS is reported to network manager 131 at BIP-8 is determined by source 202 after the previous STS-1 

step 534. In this case, the highest transport level is identified frame is scrambled at source 202 prior to an electrical-to- 

as a STS-1 at step 536. optical conversion to the OC-1 signal. Upon receipt of the 

As the flow chart of FIGS. 5A-5D illustrate, this aspect of 30 BIP-8, multiplexer 210 can determine whether a received 

root cause analysis seeks to identify the highest transport scrambled STS-1 contains any errors, 

level that is experiencing simultaneous error activity. Note After the error calculation and subsequent unscrambling, 

that while all non-zero error activity is reported to the higher the STS-1 is provided as one of the low-speed inputs to 

layers in network management system 100, only the highest multiplexer 210. If multiplexer 210 is an OC-48 system, 48 

level path PAS is forwarded to NML 130. This reduces the 35 STS-ls are byte-interleaved multiplexed into a STS-48 

amount of trouble isolation processing that occurs at NML signal. After scrambling, the STS-48 signal is converted into 

130. Generally, the identification of the highest level path an OC-48 signal for transmission to regenerator 252. In this 

PAS allows network management system 100 to identify the process, note that errors can be generated internally by 

true source of the network error activity. Note also that the multiplexer 210. For example, a multiplexing card (not 

identification process of FIGS. 5A-5D need not begin at the 4Q shown) within multiplexer 210 could introduce errors into 

DS1 level. For example, the identification process could the SPE of one of the 48 STS-1 signals. Since these errors 

begin at step 512 after a DS3 PAS is reported. were introduced prior to the scrambling of the STS-48 

Having described the general correlation process between si S nal > toe errors would not be detected by the line and 
signal transport levels, a simple example is provided with sectioD entitv error calculations. These errors will be 
reference to FIG. 2. In this example, assume that a STS-1 45 detected by the path entity error detection process, 
channel is provisioned between source 202 and destination Specifically, the PMP associated with destination 204 can 
204. Assume further that PMPs are associated with source monitor the errors and generate a path PAS. 
202 and destination 204 and SMPs are associated with As noted above, multiplexer 210 can multiplex 48 STS-1 
multiplexer 210, cross-connect 240, and regenerators 220 s into the OC-48 signal. Each of these 48 STS-1 s could 
and 230. Both the PMPs and SMPs are activated. If errors 50 traverse independent paths throughout the network and 
are generated in any of network elements 202, 210, 220, 230, terminate on independent destinations. If multiplexer 210 
240 in the end-to-end path, a STS-1 path PAS may be generates errors on each of the 48 STS-ls, then potentially 
generated by a PMP associated with destination 204. After 48 separate STS-1 path PASs could be generated. Correla- 
receipt of a STS-1 path PAS, the correlation process con- tion between these path PASs could quickly identify multi- 
tinues at step 526 of FIG. 5D. At step 526, a layer in network 55 plexer 210 as the root cause of all the error activity, 
management system 100 determines whether the STS-1 is fig. 6 illustrates the path PAS correlation process. This 
mapped into an OC-48 or OC-3/12 system. This determi- process begins at step 602 where a layer in network man- 
nation is based upon a retrieved topology of the provisioned agement system 100 receives a plurality of path PASs. Next, 
channel. In this example, the STS-1 is mapped into the a i s tep 604, the layer determines whether it has entire 
OC-N system that defines line entity 257. Assuming that the 60 information for all of the paths that have reported a PAS. If 
OC-N system is an OC-48 system, the correlation process the layer determines at step 606 that it does not have enough 
proceeds to step 530 where it is determined whether the information, the root cause analysis must be performed by 
OC-48 system has reported a PAS. one of the higher layers in network management system 100. 

The determination at step 530 can be answered in the This is illustrated by step 606. If the layer determines at step 
affirmative if the SMP that is associated with cross-connect 65 606 that it does have enough information, then the process 

240, reports a line PAS for the STS-48 signal. If this is the continues to step 608. At step 608, the layer determines 

case, the STS-1 path PAS is correlated to the OC-48/STS-48 whether a common line entity exists. If the layer determines 
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at step 608 that a common line entity does not exist then the 702 of FIG. 7, a regular path PAS could also be used in the 

process ends. Conversely, if the layer determines at step 608 unreported root cause analysis. 

that a common line entity does exist, a possible root cause After a raw PM path PAS is generated or a regular path 

of the problem is identified at step 610. PAS is received by a layer in network management system, 

As described above, this possible root cause may repre- 5 the layer determines whether it has enough information to 

sent a convergence point of paths that generate path PASs. determine the root cause^Tnis process is represented by the 

Bv identifying a convenience point the layer bypasses p stcps After a layer in network man- 

y rying rg P * y yP agement system determines at step 706 that it has enough 

redundant processing of the individual path PASs. Speed and .% t . J 4 , t ... * 

a- . r c « , • * t . . . information to determine the root cause, the process contin- 

efficiency of error detection and correction is thereby ^ tQ 

improved. ^ a j a y er determines, at step 708, that the path entity does 

In a second embodiment of the present invention, layers not mc hjde distinct line entities, then the path-line correla- 

within network management system 100 use raw perfor- tion process of FTC 7 ends. In this case, the unreported root 

mance monitoring (PM) data to identify root causes of cause analysis cannot be further narrowed to a part of the 

problems in the network. One goal of raw PM data analysis path entity. However, if the layer determines, at step 708, 

is to identify network problems at the earliest possible stage. 15 that the path entity does include distinct line entities, then 

In other words, it is desirable for a service provider to the layer next determines, at step 710, whether a line entity 

identify a potential network problem before any significant within the path entity has reported non-zero error activity, 

effects are felt by the customer. In this manner, the service If none of the line entities within the path entity has 

provider is able to correct a problem before the customer is reported non-zero error activity, then the path-line correla- 

aware that a problem exists. tion process ends. As noted above, this situation may result 

One example of a potential problem is the existence of if the monitoring points associated with the line entities have 

"dribble" errors. In this context, dribble errors are used to not have been activated. Subsequent activation of the moni- 

refer to a situation where a system is operating satisfactorily toring points will allow further unreported root cause anaty- 

but not error-free. Errors that are reported by the monitoring ^ sis in a later monitoring period. 

points to the layers in network management system 100 are Finally, if the layer determines, at step 710, that a line 

typically not large enough to cause a monitoring point to entity has reported non-zero error activity, then the root 

declare a PAS. Ordinarily, these non-zero error reports cause of the network problem is narrowed to a particular line 

would not prompt any action by a service provider. entity. Additional root cause analysis then proceeds at step 

However, these non-zero errors could indicate that a net- 3Q 712 to further identify the root cause of the problem within 

work element is operating at a point near the acceptable the identified line entity. 

tolerance levels. Numerous examples exist. Intermittent FIG. 8 illustrates a second method of unreported root 

errors could simply be caused by a dirty connector in a cause analysis that is based upon the correlation of error 

fiber-optic link In other cases, synchronization shifts could activity in line entities and section entities. The line-section 

cause jitter tolerance levels to be exceeded. In other 35 correlation process begins at step 802 where a layer in 

examples, temperature or humidity variations could cause network management system 100 generates a raw PM line 

network element performance to periodically degrade. PAS or receives a regular PAS from a monitoring point. 

Regardless of the cause, intermittent non-zero error After a raw PM fine PAS is generated or a regular path PAS 
reports will be provided to the layers in network manage- is received by a layer in network management system, the 
. ment system 100. Each layer in network management sys- 40 layer determines whether it has enough information to 
tern 100 can independently analyze the existence of non- determine the root cause. This process is represented by the 
zero error activity over a period of time. Experience in the loop of steps 804 and 806. After a layer in network man- 
analysis of the non-zero error activity can lead to a cone- agement system determines at step 806 that it has enough 
lation between specific patterns of error activity with the information to determine the root cause, the process contin- 
existence of specific network problems. Any means of 45 ues to step 808. 

statistical analysis can be used as a means for triggering a At step 808, the layer determines whether a distinct 

root cause analysis process. For example, if specific patterns section entity exists for the line entity. If element manager 

of error activity are known to lead to certain failures, general 143 determines, at step 808, that the line entity does not 

pattern recognition systems (e.g., neural networks) can be include distinct section entities, then the line-section corre- 

used for triggering purposes. As noted above, this statistical 50 lation process of FIG. 8 ends. In this case, the unreported 

analysis can be performed at each layer of network man- root cause analysis cannot be further narrowed to a part of 

agement system 100 simultaneously. The only difference in the line entity. However, if the layer determines, at step 808, 

processing is the scope of PM data that is available to an that the line entity does include distinct section entities, then 

element in the particular layer in network management the layer next determines, at step 810, whether a section 

system 100, 55 entity within the line entity has reported non-zero error 

FIG. 7 illustrates a flow chart of the path and line entity activity, 

correlation process in the second embodiment. The process If none of the section entities within the line entity has 

begins at step 702 upon the generation of a raw PM path reported non-zero error activity, then the line-section corre- 

PAS. A raw PM path PAS is generated by one of the layers lation process ends. Finally, if the layer determines, at step 

in network management system 100 upon an analysis of the 60 810, that a section entity has reported non-zero error activity, 

reported raw PM path data. As noted above, analysis of then the root cause of the network problem is narrowed to 

patterns of error activity over time could cause a layer in the particular section entity. Since the section entity is the 

network management system 100 to identify a potential lowest level of granularity, the unreported root cause analy- 

network problem. Note that a raw PM path PAS is generated sis is complete. Thus, at step 812, the section entity is 

by a layer in network management system 100 while a 65 identified as the root cause of the network problem, 

regular path PAS is generated by a monitoring point asso- As described above, a function of the unreported root 

ciated with a network element. As further illustrated at step cause analysis is to correlate non-zero error activity. Raw 
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PM path PASs can be correlated with non-zero zero error error activity, a STS-1 raw PM PAS is reported to the next 

activity on a line entity and raw PM line PASs can be further highest layer at step 934. The highest transport level is then 

correlated to non-zero error activity on a section entity. In a identified as a STS-1 at step 936. 

similar manner to the reported root cause analysis, the [f the layer determines at step 926 that the STS-1 is 

unreported root cause analysis correlates non-zero error 5 mapped into an OC-48 system, the layer then determines, at 

activity between various signal levels in the transport hier- step 930, whether the OC-48 system has reported non-zero 

archy. In this correlation process, the highest rate in the error activity. If the OC-48 system has reported non-zero 

transport signal hierarchy experiencing simultaneous error error activity, the highest transport level is identified as a 

activity is identified. FIGS. 9A-9D illustrate this correlation OC-48 at step 938. Conversely, if the OC-48 system has not 

process. In the following description, assume that a layer in 10 reported non-zero error activity, a STS-1 raw PM PAS is 

network management system 100 has either generated a raw reported to the next highest layer at step 934. In this case, the 

PM DS1 path PAS or received a regular DS1 PAS from a highest transport level is identified as a STS-1 at step 936, 

monitoring point. As the flow chart of FIGS. 9A-9D illustrate, the present 

The transport level determination process begins at step invention seeks to identify the highest transport level that is 

902 of FIG. 9 A At step 902, the layer determines whether 15 experiencing simultaneous error activity. Note again that the 

the DS1 channel is mapped into a DS3 channel or into a identification process of FIGS. 9A-19D need not begin at 

VT-1.5 channel. If the DS1 channel is mapped into a DS3 the DS1 level. For example, the statistical analysis could 

channel, the process continues at step 906. At step 906, the identify a potential problem at the DS3 level. In this case, the 

layer determines whether the DS3 in which the DS1 is process described in FIGS. 9A-9D begins at step 912. 

mapped has reported non-zero error activity. In a similar 20 piG. 10 illustrates the process of correlating path PASs. 

manner to step 906, step 908 is invoked if the layer This p rocess begins at step 1002 where a layer in network 

determines, at step 902, that the DS1 is mapped into a management system 100 identifies a plurality of path PASs. 

VT-1.5 channel. At step 908, the layer determines whether These path PASs include both raw PM path PASs and 

the VT-1.5 in which the DS1 is mapped has reported regular PASs. The raw PM path PASs may be generated by 

non-zero error activity. In the simplest example for either 25 Q f the layers within network management system 100. 

scenario, the layer determines whether non-zero error activ- Aftef ^ plurality of path PASs are identified, the layer 

ity has been recorded for the same monitored parameter determines at step 1004 whether it has entire information for 

identified in the original statistical analysis. ^ of ^ paths ^ have reported a PAS If the layer 

If the determination at either step 904 or step 906 deter- ^ determines at step 1004 that it does not have enough 

mines that the VT-1.5 or DS3, respectively, has not reported information, the root cause analysis must be performed by 

non-zero error activity, then the layer, at step 908, reports the one Q f the higher layers in network management system 100. 

DS1 raw PM PAS to the next highest layer in network This is illustrated by step 1008. If the layer determines at 

management system 100. Next, at step 910, the highest s t ep 1006 that it does have enough information, then the 

transport level is identified as a DS1 and the process ends. ^ process continues to step 1008. At step 1008, the layer 

Returning to step 906, if the DS3 in which the DS1 is determines whether a common fine entity exists. If the layer 

mapped does report non-zero error activity, the process determines at step 1008 that a common line entity does not 

continues to step 912 of FIG. 9B. At step 912, the layer exist then the process ends. Conversely, if the layer deter- 

determines whether the DS3 is mapped into a STS-1. If the mines at step 1008 that a common line entity does exist, a 

DS3 is not mapped into an STS-1, the DS3 raw PM PAS is 4Q possible root cause of the problem is identified at step 1010. 

reported to the next highest layer at step 916. Thereafter, the In a similar manner to the reported root cause analysis, the 

highest transport level is identified as a DS3 at step 918. If correlation of path PASs allows a layer to bypass redundant 

the layer determines, at step 914, that the DS3 is mapped processing of the path PASs individually, 

into an STS-1, the layer then determines whether the STS-1 In one embodiment, the invention is directed to a com- 

has reported non-zero error activity at step 914. If non-zero 45 puter system operating as discussed herein. For example, 

error activity was not reported, the DS3 raw PM PAS is functions in each layer of network management system 100 

reported to the next highest layer. is implemented using computer systems. An exemplary 

Returning to step 904, if the VT-1.5 in which the DS1 is computer system 1102 is shown in FIG. 11. The computer 

mapped does report non-zero error activity, the process system 1102 includes one or more processors, such as 

continues to step 920 of FIG. 9C. At step 920, the layer 50 processor 1104. The processor 1104 is connected to a 

determines whether the STS-1 in which the VT-1.5 is communication bus 1106. 

mapped has reported non-zero error activity. If the associ- The computer system 1102 also includes a main memory 

ated STS-1 in which the VT-1.5 is mapped has not reported 1108, preferably random access memory (RAM), and a 

non-zero error activity, the VT-1.5 raw PM PAS is reported secondary memory 1110. The secondary memory 1U0 

to the next highest layer at step 922. Next, the highest 55 includes, for example, a hard disk drive 1112 and/or a 

transport level is identified as a VT-1.5 channel at step 924. removable storage drive 1114, representing a floppy disk 

If the layer determines at either step 914 or step 920 that drive, a magnetic tape drive, a compact disk drive, etc. The 

a STS-1 has reported non-zero error activity, the process removable storage drive 1114 reads from and/or writes to a 

continues at step 926 of FIG. 9D. At step 926, the layer removable storage umt 1116 in a well known manner, 

determines whether the STS-1 is mapped into an OC-48 60 Removable storage unit 1116, also called a program 

system or an OC-3A2 system. If the layer determines at step storage device or a computer program product, represents a 

926 that the STS-1 is mapped into an OC-3/12 system, the floppy disk, magnetic tape, compact disk, etc. As will be 

layer then determines, at step 928, whether the OC-3/12 appreciated, the removable storage unit 1116 includes a 

system has reported non-zero error activity. If the OC-3/12 computer usable storage medium having stored therein 

system has reported non-zero error activity, the highest 65 computer software and/or data. 

transport level is identified as a OC-3/12 at step 932. Computer programs (also called computer control logic) 

Conversely, if the OC-3/12 system has not reported non-zero are stored in main memory and/or the secondary memory 
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1110. Such computer programs, when executed, enable the 
computer system 1102 to perform the features of the present 
invention as discussed herein. In particular, the computer 
programs, when executed, enable the processor 1104 to 
perform the features of the present invention. Accordingly, 5 
such computer programs represent controllers of the com- 
puter system 1102. 

In another embodiment, the invention is directed to a 
computer program product comprising a computer readable 
medium having control logic (computer software) stored 1Q 
therein. The control logic, when executed by the processor 
1104, causes the processor 1104 to perform the functions of 
the invention as described herein. 

In another embodiment, the invention is implemented 
primarily in hardware using, for example, a hardware state ^ 
machine. Implementation of the hardware state machine so 
as to perform the functions described herein will be apparent 
to persons skilled in the relevant art(s). 

While the invention has been particularly shown and 
described with reference to preferred embodiments thereof, 
it will be understood by those skilled in the relevant art that 20 
various changes in form and details may be made therein 
without departing from the spirit and scope of the invention. 

What is claimed is: 

1. A method, in a layer of a network, management system, 
for identifying a root cause of a problem in a provisioned 25 
channel in a network, the provisioned channel being routed 
through a plurality of network elements, the method com- 
prising the steps of: 

(1) receiving performance monitoring data gathered for 3Q 
section, line and path entities by a plurality of moni- 
toring points during, a monitoring period, each of said 
plurality of monitoring points being associated with a 
network element; and 

(2) identifying a correlation between any error activity in 35 
said section, line and path entities as indicated by said 
performance monitoring data that is received from at 
least two different ones of said plurality of monitoring 
points to locate a root cause of a problem in either said 
section, line or path entities of the provisioned channel. 4Q 

2. The method of claim 1, wherein said step (2) comprises 
the steps of: 

(a) analyzing performance monitoring data at a first signal 
transport level reported from a first monitoring point to 
determine whether a potential problem exists; 45 

(b) identifying a second signal transport level that expe- 
riences corresponding error activity, said error activity 
being reported by a second monitoring point, wherein 
said first signal transport level is mapped into said 
second signal transport level; and 50 

(c) identifying a third monitoring point upstream of said 
second monitoring point that reports corresponding 
error activity. 

3. The method of claim 2, wherein said step (b) is repeated 
until a highest transport' level is identified. 55 

4. The method of claim 2, further comprising the step of 
discontinuing further processing for problem alert signals 
that are received by the layer in the network management 
system from monitoriDg points downstream from said first 
monitoring point, wherein said problem alert signals corre- eo 
spond to the error activity reported by said first monitoring 
point 

5. The method of claim 1, wherein said step (2) comprises 
the steps of: 

(a) analyzing performance monitoring data reported by a 65 
monitoring point associated with a line entity to deter- 
mine whether a potential problem exists; 
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(b) determining whether said line entity includes a plu- 
rality of section entities; 

(c) determining whether a monitoring point associated 
with a section entity within said line entity has reported 
corresponding error activity; and 

(d) initiating a problem handling process for said section 
entity, if a monitoring point associated with a section 
entity has reported corresponding error activity. 

6. The method of claim 5, further comprising the step of: 

(e) initiating a problem handling process for said line 
entity, if a monitoring point associated with a section 
entity has not reported corresponding error activity. 

7. The method of claim 5, further comprising a step before 
said step (b) of determining the topology of the line entity. 

8. The method of claim 1, wherein said step (2) comprises 
the steps of: 

(a) analyzing performance monitoring data reported by a 
monitoring point associated with a path entity to deter- 
mine whether a potential problem exists; 

(b) determining whether said path entity includes a plu- 
rality of line entities; 

(c) determining whether a monitoring point associated 
with a tine entity within said path entity has reported 
corresponding error activity; and 

(d) initiating a problem handling process for said line 
entity, if a monitoring point associated with a fine entity 
has reported corresponding error activity. 

9. The method of claim 8, further comprising the step of: 

(e) initiating a problem handling process for said path 
entity, if a monitoring point associated with a line entity 
has not reported corresponding error activity. 

10. The method of claim 9, further comprising a step 
before said step (b) of determining the topology of the path 
entity. 

U. The method of claim 1, wherein said step (2) com- 
prises the steps of: 

(a) analyzing performance monitoring data reported by 
monitoring points associated with a first and a second 
path entity to determine whether a potential problem 
exists; 

(b) determining whether said first and second path entities 
include a common line entity; and 

(c) initiating a problem handling process for said common 
fine entity, if a common line entity exists. 

12. The method of claim 11, further comprising a step 
before said step (b) of determining the topology of said first 
and second path entities. 

13. A system for identifying a root cause of a problem in 
a provisioned channel in a network, the provisioned channel 
being routed through a plurality of network elements, com- 
prising: 

(1) means for receiving performance monitoring data 
gathered for section, line and path entities from a 
plurality of monitoring points during a monitoring 
period, .each of said plurality of monitoring points 
being associated with a network element; and 

(2) means for identifying a correlation between any error 
activity in said section, line and path entities as indi- 
cated by said performance monitoring data that is 
received from at least two different ones of said plu- 
rality of monitoring points to identify a root cause of a 
problem in either said section, line or path entities of 
the provisioned channel. 

14. The system of claim 13, wherein said means for 
identifying comprises: 
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means for analyzing performance monitoring data at a 
first signal transport level reported from a first moni- 
toring point to determine whether a potential problem 
exists; 

means for identifying a second signal transport level that 
experiences corresponding error activity, said error 
activity being reported by a second monitoring point, 
wherein said first signal transport level is mapped into 
said second signal transport level; and 

means for identifying a third monitoring point upstream 
of said second monitoring point that reports corre- 
sponding error activity. 

15. The system of claim 14, further comprising means for 
identifying a highest signal transport level that experiences 
corresponding error activity. 

16. The system of claim 14, further comprising means for 
discontinuing further processing for problem alert signals 
that are received by the layer in the network management 
system from monitoring points downstream from said first 
monitoring point, wherein said problem alert signals corre- 
spond to the error activity reported by said first monitoring 
point. 

17. The system of claim 13, wherein said means for 
identifying comprises: 

means for analyzing performance monitoring data 
reported by a monitoring point associated with a line 
entity to determine whether a potential problem exists; 

means for determining whether said line entity includes a 
plurality of section entities; 

means for determining whether a monitoring point asso- 
ciated with a section entity within said line entity has 
reported corresponding error activity; and 

means for initiating a problem handling process for said 
section entity, if a monitoring point associated with a 
section entity has reported corresponding error activity. 

18. The system of claim 17, further comprising: 
means for initiating a problem handling process for said 

line entity, if a monitoring point associated with a 
section entity has not reported corresponding error 
activity. 

19. The system of claim 17, further comprising means for 
determining the topology of the line entity. 

20. The system of claim 13, wherein said means for 
identifying comprises: 

means for analyzing performance monitoring data 
reported by a monitoring point associated with a path 
entity to determine whether a potential problem exists; 

means for determining whet her said path entity includes 
a plurality of line entities; 
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means for determining whether a monitoring point asso- 
ciated with a line entity within said path entity has 
reported corresponding error activity; and 

means for initiating a problem handling process for said 
line -entity, if a monitoring point associated with a line 
entity has reported corresponding error activity. 

21. The system of claim 20, further comprising: 
means for initiating a problem handling process for said 

path entity, if a monitoring point associated with a line 
entity has not reported corresponding error activity. 

22. The system of claim 21, further comprising means for 
determining the topology of the path entity. 

23. The system of claim 13, wherein said means for 
identifying comprises: 

means for analyzing performance monitoring data 
reported by monitoring points associated with a first 
and a second path entity to determine whether a poten- 
tial problem exists; 

means for determining whether said first and second path 
entities include a common line entity; and 

means for initialing a problem handling process for said 
common line entity, if a common line entity exists. 

24. The system of claim 23, further comprising means for 
determining the topology of said first and second path 
entities. 

25. A computer program product, comprising: 

a computer usable system medium having computer read- 
able program code means embodied in said medium 
that identifies a root cause of a problem in a provisioned 
channel in a network, the provisioned channel being 
routed through a plurality of network elements, said 
computer readable program code means comprising: 

first computer readable program code means for causing 
a computer to effect a reception of performance moni- 
toring data gathered for section, line and path entities 
from a plurality of monitoring points during a moni- 
toring period, each of said plurality of monitoring 
points being associated with a network element; and 

second computer readable program code means for caus- 
ing a computer to effect an identification of a correla- 
tion between any error activity in said section, line and 
path entities as indicated by said performance moni- 
toring data that is received from at least two different 
ones of said plurality of monitoring points to identify a 
root cause of a problem in either said section, line or 
path entities of the provisioned channel. 
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